Algebraic stability indicators for ranked lists in molecular profiling – Supplementary Material – Rev1
نویسندگان
چکیده
1 Permutation groups and distances 1 1.1 Distances 2 2 Equivalence of Borda list with best average position list 2 3 Metrics on partial lists 2 4 Properties of the harmonic numbers 2 5 Properties of the Canberra distance 4 5.1 Proof 5 5.2 The standard deviation indicator 5 6 Normalized indicators 5 7 Feature modules 5 8 Dataset shaving experiment 6 8.
منابع مشابه
Algebraic stability indicators for ranked lists in molecular profiling
MOTIVATION We propose a method for studying the stability of biomarker lists obtained from functional genomics studies. It is common to adopt resampling methods to tune and evaluate marker-based diagnostic and prognostic systems in order to prevent selection bias. Such caution promotes honest estimation of class prediction, but leads to alternative sets of solutions. In microarray studies, the ...
متن کاملR2KS: A Novel Measure for Comparing Gene Expression Based on Ranked Gene Lists
Bioinformatics analyses frequently yield results in the form of lists of genes sorted by, for example, sequence similarity to a query sequence or degree of differential expression of a gene upon a change of cellular condition. Comparison of such results may depend strongly on the particular scoring system throughout the entire list, although the crucial information resides in which genes are ra...
متن کاملInsights into the regulation of human Rev1 for translesion synthesis polymerases revealed by the structural studies on its polymerase-interacting domain.
Dear Editor, Translesion synthesis (TLS) allows the DNA replication machinery to bypass an unrepaired DNA damage site using special polymerases called TLS polymerases (Fischhaber and Friedberg, 2005). When compared with the replicative polymerases, TLS polymerases have comparatively large active sites to incorporate the base opposite the damaged DNA and low fidelity to ensure progression of syn...
متن کاملStability and aggregation of ranked gene lists
Ranked gene lists are highly instable in the sense that similar measures of differential gene expression may yield very different rankings, and that a small change of the data set usually affects the obtained gene list considerably. Stability issues have long been under-considered in the literature, but they have grown to a hot topic in the last few years, perhaps as a consequence of the increa...
متن کاملAlgebraic Comparison of Partial Lists in Bioinformatics
The outcome of a functional genomics pipeline is usually a partial list of genomic features, ranked by their relevance in modelling biological phenotype in terms of a classification or regression model. Due to resampling protocols or to a meta-analysis comparison, it is often the case that sets of alternative feature lists (possibly of different lengths) are obtained, instead of just one list. ...
متن کامل